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Abstract. A (1 + e)-approximate distance oracle for a graph is a data structure that supports approx- 
imate point-to-point shortest-path-distance queries. The most relevant measures for a distance-oracle 
construction are: space, query time, and preprocessing time. 

There are strong distance-oracle constructions known for planar graphs (Thorup, JACM'04) and, subse- 
quently, minor-excluded graphs (Abraham and Gavoille, PODC'06) . However, these require J?(e~^n Ig n) 
space for n-node graphs. 

We argue that a very low space requirement is essential. Since modern computer architectures involve 
hierarchical memory (caches, primary memory, secondary memory), a high memory requirement in 
effect may greatly increase the actual running time. Moreover, we would like data structures that can 
be deployed on small mobile devices, such as handhelds, which have relatively small primary memory. 
In this paper, for planar graphs, bounded-genus graphs, and minor-excluded graphs we give distance- 
oracle constructions that require only 0(n) space. The big O hides only a fixed constant, independent of 
e and independent of genus or size of an excluded minor. The preprocessing times for our distance oracle 
are also faster than those for the previously known constructions. For planar graphs, the preprocessing 
time is O(nlg^n). However, our constructions have slower query times. For planar graphs, the query 
time is 0(e~^ Ig^ n). 

For bounded-genus graphs, there was previously no distance-oracle construction known other than the 
one implied by the minor-excluded construction, for which the constant is enormous and the prepro- 
cessing time is a high-degree polynomial. In our result, the query time is 0(e~^(lgn -I- g)^) and the 
preprocessing time is 0{n{lgn){g^ + Ign)). 

For all these linear-space results, we can in fact ensure, for any S > 0, that the space required is only 
1 + S times the space required just to represent the graph itself. 



* An extended abstract is to appear in the Proceedings of the 38th International Colloquium on Automata, Languages 
and Programming (ICALP 2011) 



1 Introduction 



A {l-\-e) -approximate distance oracle for a graph is a data structure that supports point-to-point approximate 
distance queries. A distance-oracle construction for a family of graphs has three complexity measures: 

— preprocessing time: time to build the data structure, 

— space: how much space is occupied by the data structure, and 

— query time: how long does it take for a query to be answered. 

Each of these quantities might depend on the stretch parameter 1 + e (which is defined as the maximum 
ratio over all pairs of nodes of the query output divided by the length of a shortest path) as well as the size 
of the graph. 

General graphs For general graphs, for stretch less than 2, no approximate distance oracle is known that 
achieves subquadratic space and sublinear query time. (In Section 2.1, we briefly survey work on general 
graphs.) 

Restricted graph families The only known constructions that achieve (1 -I- e) stretch are for restricted fam- 
ilies of graphs: planar graphs [Tho04], minor-excluded graphs [AGOG], and graphs of low doubling dimen- 
sion [Tal04,HPM06,Sli07,BGK+10]. (In Section 2.2, we survey previously known results in this area.) 

Fortunately, such graphs arise in applications, e.g. relating to road maps. Distance oracles can be used 
in finding nearby points of interest, in navigation and route-planning, and in algorithms for solving other 
optimization problems such as vehicle routing. 

Space requirements One obstacle to the widespread adoption of this technique may have been the space 
requirements of known distance oracles. Even the most compact distance oracle of Thorup [Tho04] requires 
!7{e^^n\gn) space for n-node graphs. Even though the constant is quite modest, the storage required is 
rather large [MZ07]. 

Since modern computer architectures involve hierarchical memory (caches, primary memory, secondary 
memory), a high memory requirement in effect may greatly increase the actual query time. If the distance 
oracle could fit in cache, the query time could be much faster than if secondary or primary memory must be 
accessed. For smaller, less powerful mobile devices such as handhelds, the problem is exacerbated. 

Our contribution 

In this paper, for every family of graphs for which a nontrivial (1 -I- e)-approximate distance oracle is known 
(planar, bounded genus, 7?-minor-free, bounded doubling dimension), we give such a distance oracle that 
in addition requires only linear space.'* In fact, for any S > 0, there is such a distance oracle whose space 
requirement is only 1 + S times the space required just to store the graph itself; thus the overhead due to 
the distance oracle is in essence negligible. 

We achieve this while increasing the query time by a factor that is almost proportional to the decrease 
in space. ^ For planar graphs, the query time of our oracle is 0(e~^ Ig^ n). Although the query time for our 
constructions is slower than that for the superlinear-space constructions, the increase may be partly made 
up for by the decrease in actual time due to better memory performance (because of the memory hierarchy). 
The space/query-time tradeoff is tunable, so the construction can be adapted to a particular architecture. 

The preprocessing time is also faster for our schemes than for the superlinear-space constructions. 

* Our results for bounded- doubling- dimension graphs hold only for unit lengths. 

^ The product of space times query time for our oracle is 0{ne~^lg^n) while that same product is 0{ne~^ Ign) 
for [Tho04] 



2 



Bounded-genus graphs For bounded-genus graphs, there was previously no distance-oracle construction 
known other than that implied by the minor-excluded construction, for which the constant is enormous 
and the preprocessing time is a high-degree polynomial. We give a more efficient construction tailored to 
graphs of genus g. For our linear-space oracle, the query time is 0(e~^(lg n+g)^) and the preprocessing time is 
0(n(lg n){g^+\gn)). (There is also an alternative construction with preprocessing time 0(n(lg n)(g/e+\gn)).) 
We also provide an approximate distance oracle using space 0{ne~^{\gn + g)) but having faster query time 
0{g/e) (Theorem 3). 

Bounded-doubling-dimension graphs We also provide a linear-space approximate distance oracle for unit- 
length graphs with bounded doubling dimension (Theorem 5) using an approach that does not require 
separators. The query time of our construction is e^*^*^" ' • (Ign)*^*^"^. 

Summary of our results A summary of our results is given in Table 1. 



Graph Class 


Preprocessing 


Query 




Planar Undirected 
Planar Directed 

Reachability Oracle 
Genus g 
//-minor-free 
a-doubling, unit lengths 


0(n Ig^ n) 

0(n(lg(n7V))(lgn)\-2) 
0(n Ig n) 

0(n(lgn)(5^ + lgn)) 
0{poly{n, e)) 
e-Oi°:)o{poly{n)) 


0{{e-'{lgn)MnN))f) 
0(lg' n) 

0{e-'{lgn + gf) 
0{e-\\gnf) 
e-0("') • (Ign)O(-) 


Theorem 1 
Section 3.3 
Section 3.3 
Theorem 2 
Theorem 4 
Theorem 5 



Table 1. Time complexities of our linear-space (1 -I- e)-approximate distance oracles. A'^ denotes the largest integer 
weight. Due to space restrictions, some are in [KKSll]. 



2 Previous work on approximate distance oracles 

2.1 General and sparse graphs 

Thorup and Zwick [TZ05] gave asymptotically almost optimal trade-offs for distance oracles for general 
undirected graphs, proving that for any graph and for any integer k there is a {2k— l)-approximate distance 
oracle using space 0(fcn^+^/'^) and query time 0{k). They also prove that, if stretch strictly less than 2k + I 
is desired, then ]7(ri^+^/'^) bits of space are necessary. A slightly weaker lower bound holds for sparse graphs: 
Sommer, Verbin, and Yu [SVY09] prove that a distance oracle with stretch k and query time t requires space 
j^i+f2(i/(fct)) poly-logarithmic factors). Tight with respect to this bound, the distance oracle of Mendel 

and Naor [MN07] has query time 0(1) and stretch 0{k) using space 0(n^+^/'^). The oracle with the best 
stretch factor is by Patra§cu and Roditty [PRIO], who recently gave a 2-approximate distance oracle using 
space 0(n^/'^) on sparse graphs. Distance oracles with stretch strictly less than 2 have not been achieved for 
general graphs. 

2.2 Restricted graph classes: planar, excluded-minor, and bounded-doubling-dimension 
graphs 

For restricted classes of graphs, better distance oracles are known and stretch 1 -I- e can be achieved. 
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Planar graphs Thorup [Tho04] presents efficient (1 + e)-approxiniate distance oracles for planar digraphs. 
(There is a slight improvement [Kle05] to the preprocessing time for one case.) Table 2 lists these results. 

There are also many results on exact distance oracles for planar graphs. The best is that of Fakcharoenphol 
and Rao [FR06] and its subsequent improvements [Kle05,KMW10,MWN10] and variants [MS10,NuslO]. 
Faster per-query time can be achieved by using more space [Cab06,MS10]. However, all these results require 
polynomial (but sublinear) query time. There are also results on special cases of planar graphs and special 
kinds of queries [DPZ95,?,CX00,KK06]. 





Preprocessing 


Space 


Query 


Reference 


Directed 
Directed 
Directed 


0(n(lg(niV))(lgn)-*6-^) 
0(n(lgn)2(lg(niV))e-i) 
0(n(lgn + e-i)(lgn)(lg(niV))) 


0(n-6-^(lgn)(lg(n7V))) 
0(n-e-i(lgn)(lg(niV))) 
0(n-6-i(lgn)(lg(niV))) 


0(lglg(n7V)+6-^) 
0((lgn)(lglg(niV) +£-!)) 
0((lgn)(lglg(niV) +£-!)) 


[Tho04, Thm. 3.16] 
[Tho04, Prop. 3.14] 
[Kle05, Sec. 7] 


Undirected 
Undirected 


0(n(lgn)^e-^) 
0(n(lgn)2e-i) 


O (n-e-'lgn) 
O (n- e-Mgrz) 


0{e-^) 
O(e-Mgn) 


[Tho04, Thm. 3.19] 
[Tho04, Implicit] 



Table 2. Time and space complexities of (H-e)-approximate distance oracles for planar graphs on n nodes. A'^ denotes 
the largest integer weight. 



Excluded- minor graphs Abraham and Gavoille [AG06] extend Thorup's result to minor-free graphs. After a 
polynomial-time preprocessing step, point-to-point queries can be answered in time 0(e~^ Ig^i) using a data 
structure of size 0{ne^^ Ig?^)- 

Bounded-tree-width graphs For digraphs with tree-width w, Chaudhuri and Zaroliagis [CZOO] give a 0{w^n)- 
time algorithm to compute a distance oracle with query time 0{w^a{n)), where a{n) denotes the inverse 
Ackermann function. Gavoille et al. [GPPR04, Theorem 2.4] provide a distance oracle with space 0{n-w Ig^ n) 
and query time O(lgn). 

Graphs of bounded doubling dimension Let A denote the aspect ratio (diameter divided by minimum dis- 
tance) and let a = lg2 A denote the doubling dimension. Har-Peled and Mendel [HPM06] , improving upon 
earlier results by Talwar [Tal04] and Slivkins [Sli07], provide a (l-l-e)-approximate distance oracle (they term 
it compact representation scheme) using space (l/e)°(")n with query time 0{a). Bartal et al. [BGK+10], 
extending [HPM06], recently gave a distance oracle with constant query time, at the cost of increasing the 
space consumption. We shift the trade-off in the reverse direction, increasing the query time while reducing 
the space requirement to linear (see Section 7 in the appendix for details). 

3 Linear-space approximate distance oracle for planar graphs 

We prove our main theorem. The description of the improved preprocessing algorithm can be found in its 
own section (Section 4). 

Theorem 1. For any undirected planar graph G with non-negative edge weights there exists a (1 -|- e)- 
approximate distance oracle with query time 0{e~^ n), linear space, and preprocessing time O(nlg^n). 

3.1 Review^ of Thorup's distance oracle 

We briefly review a variant of Thorup's distance oracle for undirected graphs (using somewhat different 
terminology).® 

This variant does not appear in [Tho04] but is an obvious simplification, analogous to that of [Tho04, Proposi- 
tion 3.14] (which applies to directed graphs) resulting in slower query times. Since our query time is slower anyway, 
this simplified variant suffices. 
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There are two core ideas. The first is approximately representing shortest paths that intersect a shortest 
path. Let P be a shortest path in a graph G. A pair (p, v) of nodes where p is in P and v is in G is a 
connection for v with respect to P. A set C of such connections covers v in G with respect to P if, for every 
node p of P, there is a connection (p' , v) in C such that 

dist(p,p') + dist(p', w) < (1 + e) dist(p, v) (1) 

Let u, V be nodes of the input graph. Let Q be the shortest u-to-v path that intersects P. Suppose C is a set 
of connections that covers u and v with respect to P. Then it contains connections (p, u), {p' , v) such that 

dist(u,p) + dist(p,p') + dist(p',w) < (1 + e)length(Q) (2) 

Thorup gives an algorithm that, given a (mostly) planar graph G and a shortest path P, computes a set C 
of connections that covers all nodes of G and that has 0(e~^) connections per node v. In Section 4, we give 
an algorithm that achieves a faster^ running time by covering only a subset of the nodes of G. The distance 
oracle involves storing with each node v the connections that cover v with respect to several shortest paths 
(and the distances associated with these connections). The storage required for v thus has size 0{e~^) times 
the number of such paths. 

The second idea is recursively decomposing a planar graph with shortest-path separators.^ This idea is 
based on a lemma in [LT79] stating that, for any spanning tree T in a planar graph in which every face is a 
triangle, there is a nontree edge e such that the unique simple cycle in T U {e} is a balanced separator. The 
nodes of this separator comprise two paths in T. 

The distance-oracle construction uses this lemma with T being a shortest-path tree to recursively decom- 
pose the input graph. The recursive decomposition defines a binary decomposition tree in which each node 
X is labeled by (i) a subgraph G{x) of the input graph and (ii) the separator S{x) used to decompose G{x), 
if X is not a leaf. If x is the root, G{x) is the input graph. If x has children y and z, removing the separator 
S{x) from G{x) results in two separated subgraphs, G{y) and G{z). If x is a leaf, G{x) consists of one node. 

Each input-graph node v is associated with some decomposition-tree node, namely, the leafmost node x 
whose subgraph includes v. We say that the ancestors of x are relevant to v. Thus each input-graph node v 
has O(lgn) relevant tree-nodes. The distance oracle assigns a label to v that consists of a set of connections; 
for each tree-node x relevant to v, for each of the two paths P comprising the separator S{x), the distance 
oracle stores a set of connections that cover v in G{x) with respect to P. It follows that the label of v has 
size 0{e~^ ^gn). 

Next we show that these labels suffice to estimate point-to-point distances. We say that a tree- node x is 
relevant to a path Q if S{x) contains a node of and is the most relevant if x is the rootmost relevant tree 
node. 

Lemma 1. If x is the tree-node most relevant to Q then G{x) contains Q. 

Let u, V be any pair of input-graph nodes, and let Q be the shortest u-to-v path. Let x be the tree-node most 
relevant to Q. Then G{x) contains Q, and at least one of the paths comprising the separator S{x), say P, 
intersects Q. It follows from (2) that the u-to-v distance is approximately 

dist(u,p) + dist(p,p') + dist(j3', v) (3) 

for two nodes p,p' on P. To estimate the u-to-v distance, therefore, the following procedure suffices: for every 
tree-node x that is relevant to u and w, compute the minimum of (3) over connections (p, u) and connections 
(p',w) where p and p' belong to one of the two paths comprising S{x). This takes time proportional to the 
number of such connections. We review this process in Section 3.2 since in our case the situation is slightly 
more complicated. 

' Our algorithm depends on G being wholly planar (as opposed to mostly planar as in [Tho04] , which is the case for 

the variant we address. 
* A similar but more involved such decomposition arose in [AGK"'"98]. 
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3.2 Our compact distance oracle 

Our linear-space construction draws on another kind of recursive decomposition using separators. 

Definition 1 (Frederickson [Pre87]). A division of a graph G is a partition of the edges of G into edge- 
induced subgraphs. A node of G is a boundary node of the partition if it belongs to more than one subgraph. 
An r-division of an n-node planar graph G is a division of G into 0{n/r) subgraphs, called regions, with 
the following properties: (i) Each region contains 0{r) edges, and (ii) the number of boundary nodes in each 
region is at most 0{y^). 

Note that there are 0{n/y/r) boundary nodes in total. 

Lemma 2 (Frederickson [Pre87]). A planar graph on n vertices can be divided into an r-division in 
O(nlgn) time. 

Using an i — division to obtain linear space Before carrying out the recursive decomposition with 
shortest-path separators, our preprocessing algorithm computes an r-division for r = P' (where ^ is a 
parameter). Subsequently, connections (v,w) are only stored for those nodes v that are boundary nodes of 
the 1 — division. Since there are 0{n/ y/r) boundary nodes, the connections and associated distances require 
storage 0{{nt~^ \gn)/y/r). We choose t = 0(e~^ Ign) so the total storage is 0{n). 

An s-to-t query is handled as follows. First, compute shortest-path distances from s to all the nodes in s's 
region Rg. This takes 0{P) time [HKRS97]. At this point, the query algorithm has distances in the subgraph 
Rs from s to all the boundary nodes of Rs (and to t, if t is in Rg). There are 0{t) such boundary nodes. 
Similarly, compute shortest-path distances to t from all the nodes in t's region i?t, obtaining distances in the 
subgraph Rt to t from all the boundary nodes of Rt- 

Let A,B be, respectively, the set of connections for boundary nodes of Rs,Rt. For each separator path 
P that has connections in A and B, the procedure described in Section 3.2 finds the shortest s-to-t path 
that enters P via a connection of A and leaves P via a connection of B. The time is linear in the number of 
such connections (see also [Tho04, Section 3.2.2] and [Tho04, Lemma 3.6]). Since each of the 0{i) boundary 
nodes of Rg and Rt has 0(e~^ Ign) connections, the total time for these computations is 0(£e~^ Ign). 

Finally, return the minimum overall path-length (including the s-to-t distance within Rg, it t belongs to 
Rg). The total time for handling the query is 0{i^ -\- £e^^ ^gn). 

Details of the query algorithm We now explain how we find the shortest s-to-t path that enters P via 
a connection of A and leaves P via a connection of B. The method is a generalization of that in [Tho04, 
Sections 3.2.1 and 3.2.2]. 

For each connection {b,p) in A, b is a. boundary node of Rg and we have dist(s,6). For each connection 
{b,p) in B, b is a. boundary node of Rt and we have dist(t, &). Let C be the sequence of all connections 
{s,p) and {t,p) in AU P, sorted according to the position of p on P. We use the following procedure. 

initialize mg, nit, d :— oo 
initialize p :— po 

for each connection (p, b) in C in order, 
rUg :— rUg -f dist{p,p) 
nit :— nit -\- dist{p,p) 
p:=p 

if & is a boundary node of Rg , 

rUg := min{ms, dist(s, b) + dist(6,p)} 
if 6 is a boundary node of Rt 

rat '■— min{r7ii, dist(t, b) + dist(&,p)} 
d :— minjd, nig -f mt} 
return d 
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The procedure requires time O (number of connections considered). The procedure maintains the invariant 
that, after a node p of P has been considered in the loop, is the length of the shortest s-to-p path of the 
form that goes via a boundary node b of Rs and a connection (b,p) and then travels along P from p to p, 
where p appears before p on P. A similar statement holds for nit- It follows that the value d returned by the 
procedure is the length of the shortest s-to-t path that travels in Rg to a boundary node b of Rg, then goes 
to P via a connection for 6, then travels along P then leaves P via a connection for a boundary node b' of 
Rt then travels to t within Rt- 

3.3 Extensions: distance oracles for planar digraphs and reachability oracles 

Similar techniques (i.e. storing the connections for a subset of the nodes only) apply to more of Thorup's 
results based on shortest path separators [Tho04]. Instead of using a more sophisticated preprocessing algo- 
rithm that computes only the connections for the boundary nodes, we may compute the connections for all 
the nodes (using Thorup's preprocessing algorithms as black boxes) and then store only those for boundary 
nodes. By doing so, we obtain the following linear-space oracles. For directed planar graphs, there is a (1-f e)- 
approximate distance oracle with query time 0((e~^(lgn)(lg(nAf)))^), where N denotes the largest integer 
weight. Furthermore, there is a reachability oracle with query time O(lg^n) (using [Tho04, Theorem 2.7]). 
We exploit a similar black-box construction for minor-free graphs in Section 6. 

4 Preprocessing algorithm for linear-space approximate distance oracle for 
planar graphs 

Thorup's preprocessing algorithm for his undirected construction takes time 0{ne^^ Ig^ n) (as stated in 
[Tho04, Theorem 3.19]). We give a preprocessing scheme for our construction that takes time O(nlg^n), 
independent of e. We give details later, but here we observe that the factor O(e^^lgn) speedup has three 
sources. (This explanation is aimed at readers familiar with Thorup's paper.) 

First, since we are not aiming for a query time of 0(e~^), we can use a simpler preprocessing approach than 
the one underlying [Tho04, Theorem 3.19]; we use the approach that for directed graphs underlies [Tho04, 
Proposition 3.14]. The corresponding bound for undirected graphs is listed in Table 2 as "implicit." 

Second, we only need to compute connections for a small subset of the nodes (the boundary nodes of 
the r-division). That in itself does not seem to permit an additional speedup using Thorup's method since 
his algorithm depends not on the number of connections stored but on the sizes of the graphs searched. 
Therefore, third, in addition we use another approach to finding connections, one based on the multiple- 
source shortest-path (MSSP) algorithm of Klein [Klc05] or that of Cabello and Chambers [CC07]. 

Preprocess(Go) 

let Bq be the set of boundary nodes of an r-division [Frc87] 
let T be a shortest-path tree 

compute recursive decomposition based on cycle separators of the form T U {e} 
for each nonroot node x of recursive-decomposition tree, 
for each path P, (i — 1, 2) comprising S{x), 

compute connections for nodes of Bq in G{x) with respect to P^ 

The last step, computing the connections for nodes of Bq in G{x) with respect to Pi, works on a graph 
G'{x) obtained from G{x) by cutting along P^, duplicating the nodes and edges of Pi and creating a new 
face whose boundary consists of the two copies of P^. This modification destroys paths that cross Pi but 
such paths are not needed since Pi is a shortest path. It has the advantage that, for each copy P oi Pi, in 
G'{x) all nodes of P lie on a common face. 

For each copy P, there is a computation that selects connections {p, v) for specified nodes v with re- 
spect to that copy. The computation uses an algorithm called Path(G, P,P) that takes time 0((1G] -I- 
number of connections) Ig \G\) and selects 0(e~^) connections per node v E B. Since there are two copies of 
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two paths comprising S{x), the last step of Preprocess selects 0(e~^) connections per node of Bq in G{x). 
Therefore the total number of connections for Bq is 0{e~^ ^^.n), and the total time is 0{n Ig^ n+ \Bo \ Ig n), 
which is O(nlg^n). 

Now we describe Path(G', B, P). Let the nodes of P be po ■ • - Ps- First the algorithm computes 
i{v) = argminjdist(pi, w) and = min^ dist(pi, w). 
These can be computed using a single-source shortest-path computation in the graph obtained by zeroing 
out the lengths of the edges of P. 

For i = 0, 1, . . . , s, let Ti denote the shortest-path tree rooted at pi. For i > 0, let T/ be the tree obtained 
from Ti-i by removing the parent edge of and adding the edge piPi-i, obtaining a Pi-rooted tree (not 
a shortest- path tree). For i > 0, let ct^ denote a sequence of edges whose insertion into T/ (followed by the 
ejection of each corresponding parent edge) result in T^. Klein [Kle05] shows that each edge is inserted 
at most once, and gives an 0(|G| Ig |G|) algorithm (the multiple-source shortest-path algorithm) to compute 
these sequences. For each such inserted edge uv, the algorithm also computes the resulting change Z\„^ in the 
length of the root-to-?; path in the tree. Cabello and Chambers [CC07] give a simplification of the multiple- 
source shortest-path algorithm and generalize it to bounded-genus in 0{g^ \G\ Ig |G|) time. The algorithm 
Path uses one of these algorithms to compute the sequences Ui and the corresponding length changes . 

4.1 The two phases 

The remainder of Path consists of two phases, Forward and Backward. A connection {pi,v) might be 
added by Forward if i > i{x) and by Backward if i < i{x). We describe Forward. Backward is 
symmetric. 

4.2 The Forward phase 

The algorithm Forward iterates through the nodes ■ • • jPs of P, maintaining a tree T that is, in turn. 

To, Ti, . . . , Ts. The tree T is represented using a dynamic-tree data structure [ABH+04,AHdLT05,Fre97,ST83,TW05]. 

A node-labeling is maintained: /i(w) is a quantity (discussed later) that is used to decide whether v needs a 

new connection. This labeling is represented implicitly, as is typical in dynamic trees, so as to support bulk 

updates. In this case (somewhat atypically), an update takes the form "add a quantity A to the label of 

every tree in the subtree rooted at u." Each update takes O(lgn) amortized time. In addition, searching for 

a node v that has /i(u) < takes O(lgri) time. 

Forward(G,S,P): 
initialize T := Tq 

for every node v, initialize /i(w) :— oo 
for i = 0, 1, 2, ... , s: 

comment: T is rooted at pi 
•k for each node v £ B such that either i{v) — i or /i(w) < , 

create a connection {pi,v) 
set /i(w) := e dv 
if i < s, 

comment: now change the root... 
remove parent edge of pi+i and add edge pi+ipt 
comment: now make the tree a shortest-path tree 
f for each edge uv in the sequence Ci+i, 

remove the current parent edge of w in T, and add uv 
I for every active node w in the w-rooted subtree of T, 

^{w) := fi{w) -\- Auv 

The overall number of iterations of the loop in Step ★ is the number of connections added. The overall 
number of iterations of the loop in Step f is at most the number of edges, which is 0(|G|). Step | can 
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be done using a single bulk update in 0(lg \G\) time. Consequently, the algorithm runs in time 0((|G| + 
number of connections) Ig |G|). 

Now we show that the algorithm selects a covering set of connections (and that the set is small). At each 
moment in the execution of the algorithm, for each node v such that fj,{v) is finite, let last(u) denote the 
node p oi P such that {p, v) was the most recently selected connection for v. 

The n invariant is: for every node v for which fi{v) is finite, 

/z(u) = edy — (dist(pi, last(w)) + dist(last(i;), w) — distT(pi, v)) (4) 

Note that dist(pi, last(w)) + dist(last(w), w) is the length of the path that goes from the current root pi to v 
via last(w). When this length becomes significantly longer than dist(p, u) (longer by ed^), ^(f) < so the 
node V is included in the loop in Step *, so the connection (pi, v) is added. This shows that the connections 
added by Forward and Backward cover each node v e B. 

To bound the number of connections, we follow Thorup in using the potential function (p^ = dist(ps, last(i'))+ 
dist(last(w), w). Suppose that, at some execution of Step fi{v) < 0, so dist(pi, last(w)) + dist(last(i'), w) — 
distxiPi, v) > edy. When a connection (pi, v) is then added, last(u) becomes pi, so the potential function ^Py 
is reduced by at least edy. 

Initially <Py = dist(ps,ft(w)) + dist(pj(„), ti). Throughout the phase, by the triangle inequality , <Py > 
dist(j)s,v). Again using the triangle inequality (and the fact that the graph is undirected), dist(ps,Pi(t))) < 
dist(ps,w) + dist(pi(„),z;), so <Py > dist{ps,v) > dist(ps,pj(„)) — dist(pi,ti). Thus the total amount of reduction 
in <Py is at most 2 dist(ps, w). Since each reduction is by at least edist(rj(^,') , w), the total number of reductions 
(number of connections added by Forward after the initial one) is at most [2e~^] . 

5 Approximate distance oracles for genus g graphs 

Theorem 2. For any undirected graph G embedded in a surface of Euler genus g, there exists o (1 + 
e) -approximate distance oracle with query time 0(e~^(lgn + 5)^), linear space, and preprocessing time 
0{n{\gn){g^ + lgn)). The oracle can also he constructed in time 0{n{\gn){g/e + \gn)). 

Our distance oracle for genus g graphs is based on separating shortest paths, as for planar graphs (see 
Section 3.1). Thorup [Tho04] proves that any planar graph can be recursively separated by three shortest 
paths. Abraham and Gavoille [AG06] extend his result to minor-closed families, proving that any minor-free 
graph can be recursively separated by 0(1) shortest paths. Since bounded-genus graphs exclude minors, we 
could use their result to obtain a linear-space distance oracle. The constant in [AGOG] however depends on 
the size of the minor in an unspecified way. In the following, we prove that genus g graphs can be recursively 
separated using at most 0{g) shortest paths. In fact, only the first separator consists of at most 2g paths, 
while lower levels can be separated using 3 paths. These smaller separators allow us to derive approximate 
oracles and labeling schemes with a dependency on g that is much lower than the corresponding dependency 
in the more general construction by Abraham and Gavoille [AGOG] . More formally, we also prove the following. 

Theorem 3 (fast distance queries for genus g graphs). For any undirected graph G embedded in a 
surface of Euler genus g, there exists a (1 + e) -approximate distance oracle with query time 0{g/e), space 
0{n{g -\- \gn) I e) , and preprocessing time 0(n(lg n)'^e^^ + ?i(lgn)f;/e). The oracle can be distributed as a 
labeling scheme using 0{{g + lgr7,)/e) bits per node. 

In the following, we assume that G is embedded. 
5.1 Overview 

In the first step, we "cut" the genus g graph into planar subgraphs using the tree-cotree decomposition of 
Eppstein [Epp03], which decomposes a graph of genus g into planar graphs, separated by 2g paths from a 
tree T. We choose T to be a shortest-path tree. 
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Lemma 3 (Corollary of Eppstein [Epp03, Proof of Lemma 3.2]). Any graph G of genus g on n 
nodes and m edges can be divided into planar subgraphs by a separator that consists of at most 2g shortest 
paths. Furthermore, these paths can be computed using a single-source shortest-path search in G plus 0{gm) 
time. 

Proof. The existence of these paths is due to Eppstein's tree-cotree decomposition {T,G,X) of G, where T 
is a SSSP tree (any spanning tree works), G is its cotree, and X is the set of remaining edges of size at most 
2g [Epp03]. 

Starting at an arbitrary node u, we compute a single-source shortest path tree T and its dual G . The 
separator consists of at most 2g tree paths from u to Vi for each of the remaining edges (wo, vi) S X, and of 
the edges in X. □ 

At a high level, the theorems follow by combining Eppstein's lemma with the distance oracles for planar 
graphs (Thorup [Tho04] and Sections 3 and 4). Within the planar subgraphs, we use the distance oracles for 
planar graphs. In addition to computing the connections to the separator paths within each planar subgraph, 
we also need to compute the connections to the 0{g) tree-cotree decomposition paths. Note that the latter 
set of connections consists of paths that may pass through non-planar parts. To compute these, we may use 
either [CC07] or [Tho04, Lemma 3.12], depending on the values of g and e. 

5.2 Preprocessing algorithm 

Within planar subgraphs To obtain the preprocessing and space bounds in Theorem 2, we use the pre- 
processing algorithm described in Section 4 with r :— P ^ where I = 0(e^^(lgn + g)). Since the number 
of connections per node is proportional to £ and since a 1/ -yr-fraction of the nodes per subgraph lies on 
the boundary, the overall space consumption is linear. To obtain the preprocessing and space bounds in 
Theorem 3, we use Thorup's algorithm [Tho04, Thm. 3.19] for 0(l/e) query time. 

Connections to tree-cotree separator There are two options to compute these connections: (1) We may 
use [Tho04, Lemma 3.12] (which internally uses Thorup's 0(m) SSSP algorithm [Tho99,Tho00]). The lemma 
states that, for a path Q, we can compute an e-covering set C{v, Q) for all nodes v in time 0(e~^n(lgn)). (2) 
We may use the MSSP data structure for genus g graphs by Cabello and Chambers [CC07], which requires 
0{g'^n\gn) preprocessing and then answers queries in time O(lgn). See planar preprocessing (Section 4) 
for details. The time required is 0{g^n\gn -\- number of connections • Ign) = 0{g'^n\gn). We apply either 
lemma for the at most 2g paths of the tree-cotree decomposition. (For Theorem 3, the first option gives 
faster asymptotic preprocessing time; for Theorem 2, the optimal choice depends on e and g.) 

5.3 Query algorithm 

At query time, we can essentially use the same algorithm as for the planar case (Section 3.2 and [Tho04, 
Thm. 3.19]). The only difference to the planar case is that we also need to include the at most 2g paths 
separating the genus graph into planar subgraphs. To obtain the bound on the query time in Theorem 2, 
note that computing connections through these < 2g separating paths can be done in time Oiigje) and that 
exploring both regions took time 0(£^) (where i = 0(e^^(lgn + .g))). 

6 Linear-space approximate distance oracle for i/— minor-free graphs 

Theorem 4. For any minor H there is an integer h = h{F[) such that for any undirected H -minor-free graph 
G with n nodes and m edges there exists a (1 -\- e) -approximate distance oracle with query time 0{ht~'^ Ig^ n), 
space 0{m), and polynomial preprocessing time. 
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The proof of Theorem 4 is structurally the same as for the planar case. We again use r-divisions [Fre87], 
this time tailored to minor- free graphs using a separator algorithm by Kawarabayashi and Reed [KRIO]^. 

An r-division of an 7J-minor-free graph is a division into 0{n/r) regions of 0{r) vertices each and 
0(|iJ| \/r) boundary vertices each. (Note that the boundary is larger by a factor \H\ compared to the 
boundary in the planar case.) 

Lemma 4 (Frederickson [Pre87], combined with Kawarabayashi and Reed [KRIO]). An H -minor- 
free graph on n vertices can be divided into an r-division in 0{n^ ^gn) time. 

Proof (of Theorem 4-)- The proof is a combination of three techniques: (i) Lemma 4, (ii) shortest path 
separators in minor- free graphs by Abraham and Gavoille [AGOG], and (iii) Thorup's e-covers (as described 
in Section 3.1). 

Abraham and Gavoille [AG06] prove that any if-minor-free graph can be recursively separated by k{H) 
shortest paths and that these paths can be found in polynomial time. We first compute these separator paths. 
We then compute an r-division for r := {£ ■ \H\)'^ as in Lemma 4. For all the nodes on the boundary, we 
compute connections to these shortest paths [Tho04, Lemma 3.12] (as in Section 5.2). We store £ — 0{e~^ Ig n) 
connections per node (where i depends on \H\ in an unspecified way) for 0{n/£) nodes. The total space 
requirement is thus 0{m) (without further dependencies on H). 

At query time, given a pair (s,i), we first explore both regions Rs and Rt, respectively, using an SSSP 
search [TMH09] until all the boundary nodes of these two regions have been found. This step requires time 
0(£^) (hiding further dependencies on H stemming from [TMH09]). As described in Section 3.2, we can then 
merge clean and ordered covers in linear time [Tho04, Lemma 3.6]. □ 

7 Linear-space approximate distance oracle for unit-length graphs with 
bounded doubling dimension 

The distance oracles for planar, bounded-genus, and minor-free graphs heavily used the notion of separators. 
In the following, we show that separators are not the only way to obtain linear-space approximate distance 
oracles. Our linear-space approximate distance oracle for bounded-doubling-dimension graphs exploits the 
bounded-growth property. 

The aspect ratio (also known as spread) of P C V, denoted by ^{P), is the ratio of the diameter of P 
and the distance between the closest pair of nodes in P. It is well known that for any A-doubling metric Ai , 
any set of points P C M. with aspect ratio at most D > A{P) satisfies \P\ < A'^^'^^')^ 

We use this fact to obtain the following. 

Theorem 5. For any unit-length graph G ~ (V, E) on n — \V\ nodes and m = \E\ edges with doubling 
dimension a and diameter A, and for any e > there exists a (1 + e) -approximate distance oracle using 
space 0{m) and query time (IgA)'-"^"^ ■ (l/e)'^^" \ 

Note that (1) for unit-length graphs A = 0{n) and thus the query time for constant a is 0{poly{\gn, 1/e)) 
and (2) our result also holds for unweighted geometric graphs such as those considered in [GLNS08]. Our 
approach extends to graphs with moderate edge lengths; the dependency on the largest weight is however 
polynomial and not logarithmic. 

In our proof we use the following approximate distance labeling scheme. 

Lemma 5 (Har-Peled and Mendel [HPM06, Proposition 6.10]). For a metric with doubling dimen- 
sion a and aspect ratio A — A{V) and for any e > 0, there exists a (1 + e) -approximate distance labeling 
scheme with label length (l/e)'^^"'' IgA bits per node and query time 2'-"--°'\ 

^ Alternatively, Frederickson's algorithm could be combined with the separator algorithm by Alon, Seymour, and 
Thomas [AST94] to obtain an r-division with 0{\H\'^''^ ^/r) boundary vertices per region or with the separator 
algorithm by Reed and Wood [RW09] for 0{\Hf^'' 2^^'^^'+'^^^'^r^/'-^) boundary vertices per region. See also [TMH09] 
for such modifications. 
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Proof (of Theorem 5). Let 5 < n be an integer. A S -dominating set of a graph G — {V, E) is a subset L CV 
of nodes such that for each v E V there is a node Z S L at distance at most S. It is well-known that there is 
a 5-dominating set L of size at most \L\ < n/{S + 1) and that such a set L can be found efficiently [KP98]. 

Let G — {V, E) be a graph that allows for a (1 + e)-approximate distance labeling scheme with label 
length I — (l/e)*^'"Mg Z\ (Lemma 5). We store the distance labels for the nodes of an ^-dominating set, 
which requires total space 0{n). For each unlabeled node u, we also store its nearest labeled node l{v). 

At query time, given s,t € V, we distinguish between "close" pairs (distance at most l/e) and "far" 
pairs (otherwise). For "close" pairs, we explore (using BFS) the ball B{s) of radius d = 0{i/e) around s. If 
t € i?(s), the exact distance can be returned (a "close" pair). Recall that for any A-doubling metric A^, any 
set of points P C with aspect ratio at most D > A{P) satisfies \P\ < X'^(^s^\ For a unit-length graph 
G — {V, E) with doubling dimension a — lg2 A, for any node v E V the number of nodes within distance d 
satisfies 

\{u:dG{u,v) <d}\ < A°(^sd). 

The number of edges within B{s) is at most quadratic in the number of nodes. Exploring B[s) using BFS 
thus requires time proportional to A'^'-'^'^^. For "far" pairs, we triangulate using l{s) and Z(<), returning an 
approximate distance: The algorithm returns d{u,v) — dG{u,l{u)) +'D{C{l{u)), C{l{v))) + dGiliv),v), where 
£.{w) denotes the label of w and 2?(-,-) denotes the decoding function of the labeling scheme. A simple 
calculation (using the triangle inequality and the fact that the distance d{u, v) is at least £/e) yields that the 
query result d{u, v) satisfies d{u, v) < {1 + 7e)d{u, v) (for any e G (0, 1]). □ 
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